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Empirical Baye's Method and Tests in Very Light Quark Range from The 
Overlap Lattice QCD 
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Based on Bayesian theorem an empirical Baye's method is discussed. A programming chart for mass spectrum 
fitting is suggested. A weakly constrained way for getting priors to solve the chiral log data fitting singularity is 
tested. 



1. Introduction 

Data analysis is an important procedure for 
most numerical projects and experiments. In lat- 
tice QCD, the theory we fit to hadronic two point 
correlation function is: 



G{t,A,m) 



-mi{t-to) 



(1) 



However, the general minimum procedure does 
not work for such a full physics hypothesis as the 
procedure is singular. What we used to do is to 
truncate both data set and theory. We might fit 
only the large-i behavior with the lowest mass 
particle. 

Numerically, the minimizing procedure is 
equivalent to a linear equation. Suppose p = 
{Ai,mi} is an unknown parameter vector 



minimize : A{p) 
I A-p-c 1=0 



(2) 

A singularity can occur due to the fact that 
matrix A has is degenerate or has zero eigenval- 
ues. A possible solution to the singularity is to 
add another matrix B. Suppose there is another 
minimizing procedure B{p), 

C{p) = A{p) + \B{p) (3) 

^ |C-p-c'|=0 

We can minimize C{p) instead of A{p). 



2. Bayesian Theory 

Bayesian statistics provides a useful way to of- 
fer such an additional minimizing procedure. The 
parameter vector p describes the hypothesis of 
Eq. (1). To get p from the measured data set 
D is a numerical inverse problem. The Bayesian 
theorem tells us that j^: a single good inverse p, 
is to maximize the probability 

Prob(p I D,p) = Prob(D | p,p) ■ (4) 

Here p is a prior set of the hypothesis and Prob(p | 
p) is the Bayesian prior probability. 

Maximizing the entropy of p under (p) = p, 

{P') - {P? 



ar^ gives: 
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Probp I p, p) 
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(5) 
(6) 



So the Bayesian theorem tells us that a single 
good inverse p is to minimize x\ug @] • Here the 
Xprior pl^y^ the role of the additional minimizing 
term B{p) in Eq. (3) with A = 1. 



3. Systematic errors and multiplier A 

In principle, minimizing A{p) and minimizing 
B{p) would not necessarily give the same solution 
of p. Hereby, multiplier vector A is a bridge from 
the solution of minimizing A to the solution of 
minimizing B. With A = 1 the constrained data 
modeling always introduces some systematic bias 
which depends on how the input priors match the 
unknown hypothesis. In some of the cases if we 
know the physics very well, we can "teach the 
physics in fitting procedure" . That means to in- 
put the priors according to our knowledge to the 
hypothesis ||]. However, in some of the cases we 
do not know the hypothesis well, only the mea- 
sured data set D can tell us the information both 
of the priors and hypothesis. We should not sim- 
ply maximize the probability Prob(D | p) to get 
the priors p, since that violates Bayesian theo- 
rem. Instead, we can consider to use a subset of 
the measured data. In Bayesian statistics when 
some data set Di comes along, and then some ad- 
ditional data set D2 comes again, the probability 
of p in these two cases will be m: 



minimize Xaug 



T 



Prob(p I =Prob(p I p) 



PTohjD, I p,p) 
Prob(i:>i I p) 



Prob(Hi^.,p)^Prob(Hp)- ^;"'^^'flf (7) 

Prob(£)2 I p) 

One can then prove the estimate of p probability 
in a enlarged data set: 

Proh{p\ D2Di,p) 
^ Prob(H P) . ^^^^(^H^i-^, (8) 

which shows that we would have the same answer 
if all the data D1D2 had been taken together. 
Furthermore, we can get the priors from data set 
Di then to fit p from D1D2 taken together. That 
will not violate the Bayesian theorem. So we can 
construct a real empirical Baye's method to make 
data modeling. All the information comes from 
measured data set, without any additional artifi- 
cial bias. For example, we can give a program- 
ming chart for the mass spectrum fitting such as 
in Fig. 1. 

We test this empirical Baye's method 011 a 
16"^ X 28 lattice, with j3 = 2.264 Iwasaki gauge ac- 



D^D+I, \n,n+i = 
A,<]v = 1 minimize 




yes 



N = N^a,, D^D-fM, A, = 
Ai,2 — minimize Xaug 

] 

out 

Figure 1. A programming chart of the empirical 
Baye's method. Where J 7^ 0, M / 0, A is the 
number of the parameters in each fitting procedure, 
Nmax is the maximum number of the parameters we 
want to fit, D is the number of the data points we use 
to fit the parameters. A,; is ith element of the multiple 
vector A 



tion. Quenched approximation with anti-periodic 
boundary in t-direction is used. scale gives 
a^^ = 0.9775(47)GeV. The lowest pion mass is 
about sa 179A/IeV, mT^/mp « 0.25. Empirical 
Baye's method works in such light quark area. 
Fig. 2 shows the A test at « 202MeV. Where 
whole elements of vector A are equal to 1 , except 
the first and second elements Ai, A2 are varying 
and Ai = A2. This test shows that the pion mass 
is stable, the priors were obtained from data sub- 
set matches full data set very well [Q. 

An other advantage of this empirical Baye's 
method is that sometimes we can observe a stable 
excited state. Fig. 3 shows the nucleon mass and 
the mass of the first excited state. The mass ratio 
shows that the excited state is consistent with the 
nucleon Roper state which lattice community has 
searched for years but have not been successful 
before 

1.54 ±0.17 



3 



TT mass vs A with 



0.01876 



0.22 
0.21.3 - 
0.21 
- 0.20.3 




0.5 1 1.5 2 

Figure 2. The A test for pion mass at m„ 
202MeV 
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Figure 3. The Roper state as an excited state of 
the nucleon 



data modehng to get the good priors. In this case 
we input S = 0.20 ± 0.20, A,^ = 1.1 ± 0.5 as the 
weak priors. From a data subset of m^r = 179MeV 
to rriTr = 287MeV to get the better priors. We 
then input this better priors into the full data set 
of rrij, = 179MeV to = 312MeV, and obtain 
S = 0.26 ± 0.03, A = 1.1 ± O.lGeV. Fig. 4 shows 
the resultant fit of {m.„aY . 



IG"* X 28 lattice Quenched chiral log 
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Figure 4. The weak constrained data modeling 
for the quenched chiral log 



Further work is on the way to improve the sta- 
bility of the fitting program, especially in the very 
light quark region, and try to automate the pro- 
cedure. 



4. Quenched chiral log fitting and the weak 
prior method 

The quenched chiral log is a hard problem, 
since the general minimum data modeling for 
the chiral log formula is singular §0. The nor- 
mal equation in the fitting procedure gives only 
2 independent parameters, the third one is not 
independent. 

mis = 2Amq{l-5[\n{2AmJKl)+l]}+0{ml){%) 

An additional matrix is really needed to lift the 
degeneracy in order to get the priors from data. 
An alternative way is to use weak constrained 
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